Investigating text normalization and pronunciation variants for German broadcast transcription

نویسندگان

  • Martine Adda-Decker
  • Gilles Adda
  • Lori Lamel
چکیده

In this paper we describe our ongoing work concerning lexical modeling in the LIMSI broadcast transcription system for German. Lexical decomposition is investigated with a twofold goal: lexical coverage optimization and improved letter-to-sound conversion. A set of about 450 decompounding rules, developed using statistics from a 300M word corpus, reduces the OOV rate from 4.5% to 4.0% on a 30k development text set. Adding partial inflection stripping, the OOV rate drops to 2.9%. For letterto-sound conversion, decompounding reduces cross-lexeme ambiguities and thus contributes to more consistent pronunciation dictionaries. Another point of interest concerns reduced pronunciation modeling. Word error rates, measured on 1.3 hours of ARTE TV broadcast, vary between 18 and 24% depending on the show and the system configuration. Our experiments indicate that using reduced pronunciations slightly decreases word error rates.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Regional Variants of German: Categories of Pronunciation Deviation from Standard German

This analysis describes categories of pronunciation variants we found in the transcription of monologues recorded for the RVG1 corpus (Regional Variants of German). Our results indicate that transcriptions on orthographic level provide useful information on regional variations of standard German. The pronunciation variants can be categorized into assimilation, enclitics, and types of single pho...

متن کامل

The 300k LIMSI German broadcast news transcription system

This paper describes improvements to the existing LIMSI German broadcast news transcription system, especially its extension from a 65k vocabulary to 300k words. Automatic speech recognition for German is more problematic than for a language such as English in that the inflectional morphology of German and its highly generative process of compounding lead to many more out of vocabulary words fo...

متن کامل

The bell labs German text-to-speech system: an overview

In this paper we present an overview of the German version of the Bell Labs text-to-speech system, a high-quality concatenative synthesis system with extensive text analysis capabilities. We discuss problems of text analysis, and our solutions to these problems, including: the integration of text normalization tasks into linguistic text analysis; the capability to morphologically analyze compou...

متن کامل

Generating proper name pro for automatic speech

Generating correct pronunciation of proper names remains one of the most difficult tasks in text-to-phoneme transcription. Although phonetic rules can be efficient in processing proper names of one language, foreign family names cannot be always correctly generated without additional pronunciation rules. The present study addresses the problem of pronunciation variants for French and foreign fa...

متن کامل

RVG 1 - A Database for Regional Variants of Contemporary German

Regional speaker variability is a major problem in today's stateof-the-art speech recognition systems. Therefore, a major point in the creation of speech resources is the regional coverage of data within one language. At the beginning of 1996 we started to collect data for the RVG1 (Regional Variants of German) corpus. This project was established in cooperation between the American telephone c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000